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ABSTRACT 

Hepatitis C virus (HCV) translation initiation is 
directed by an internal ribosome entry site (IRES) 
and regulated by distant regions at the 3'-end of 
the viral genome. Through a combination of 
improved RNA chemical probing methods, SHAPE 
structural analysis and screening of RNA accessibil- 
ity using antisense oligonucleotide microarrays, 
here, we show that HCV IRES folding is fine-tuned 
by the genomic 3'-end. The essential IRES 
subdomains lllb and Mid, and domain IV, adopted 
a different conformation in the presence of the 
c/s-acting replication element and/or the 3'- 
untranslatable region compared to that taken up in 
their absence. Importantly, many of the observed 
changes involved significant decreases in the 
dimethyl sulfate or /V-methyl-isatoic anhydride re- 
activity profiles at subdomains lllb and Mid, while 
domain IV appeared as a more flexible element. 
These observations were additionally confirmed 
in a replication-competent RNA molecule. 
Significantly, protein factors are not required for 
these conformational differences to be made 
manifest. Our results suggest that a complex, 
direct and long-distance RNA-RNA interaction 
network plays an important role in the regulation 
of HCV translation and replication, as well as in the 
switching between different steps of the viral cycle. 

INTRODUCTION 

The hepatitis C virus (HCV) genome is a 9.6-kb-long, 
positive, single-stranded RNA molecule (1,2). It consists 



of a single open reading frame (ORF) flanked by untrans- 
latable regions (UTRs) that contain structural elements 
essential to the execution of the viral cycle (3-9). Viral 
translation initiation is mediated by a highly structured 
element mainly located at the 5'-UTR, which acts as an 
internal ribosome entry site (IRES) (3,4) (Figure lA). 
IRES-dependent translation differs greatly from the mech- 
anism used by most cellular mRNAs, operating by the 
direct recruitment of the 40S ribosomal subunit (10,11), 
which promotes the formation of the 48S particle via the 
further recruitment of eIF3 and the elF2/GTP/Met- 
tRNA ternary complex. The configuration of the 48S 
particle permits the appropriate positioning of the 
codon-anticodon pairs in the P site during protein synthe- 
sis initiation (12-14). This apparently simple process is 
guided by a subset of regulatory structural RNA 
elements included in the IRES element. Thus, the con- 
formation of the IRES plays a crucial role in viral 
protein synthesis (14-18). 

Under physiological conditions, the minimum HCV 
IRES element required for translation initiation folds 
into a well-defined structure composed of two major 
domains (19) (II and III) and a short stem-loop 
(domain IV) containing the initiation codon (20) 
(Figure lA). Domains II and III are aligned at both 
sides of a complex double pseudoknot structure allowing 
the correct positioning of the start codon (14). 
Importantly, many of the crucial structural regions 
required for IRES-dependent translation are located in 
the highly branched domain III (21). The latter is 
organized into three- and four-way junctions defined by 
stem-loop motifs (designated subdomains Ilia to Illf) 
that mark the anchoring sites for protein factors. Thus, 
the Illabc junction directs the interaction with eIF3 
(13,22), whereas subdomains IIIc to Illf cooperate in the 
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Figure 1. Secondary structure of the 5'- and 3'-ends of the HCV genome. (A) Sequence and secondary structure of the 5'- and 3'-regions of the HCV 
genome. The 5'-UTR plus domains V and VI located inside the HCV coding sequence are included. The minimum region for IRES activity is boxed. 
The 3'-end of the viral genomic RNA is organized into two structural elements: the CRE region and the 3' X-tail, separated by a hypervariable 
sequence (HV) and a polyU/UC stretch (also considered inside the 3'-UTR). Numbers refer to the nucleotide positions of the HCV Conl isolate 
(GenBank accession number AJ238799). Start and stop translation codons placed at positions 342 and 9371, respectively, are shown in bold. 
Pseudoknot elements are marked as PKl and PK2. (B) Diagram of the HCV transcripts harboring the IRES-FLuc fusion attached to different 
3' viral genomic domains. 



recruitment of the ribosomal subunit 40S (13,23). 
Importantly, subdomain llld has been described as the 
core 40S binding center (24-26). Its structure is that of a 
dynamic stem-loop with an internal E loop motif and an 
apical loop with typical U-turn geometry (15,27). The con- 
servation of aU these structural motifs in the IRES element 
is critical for its proper function (21). 

The presence of structural elements at the 3'-end of the 
viral genome may also modulate the initiation and 



elongation steps involved in HCV translation (28-33). 
This might be mediated via the HCV genomic RNA, 
assuming a circular topology resembling the closed-loop 
structure adopted by cellular cap-mRNAs. Such circular 
topologies were initially thought to be strictly dependent 
on the recruitment of protein factors able to simultan- 
eously bind to the 5'- and 3'-UTRs of the genomic HCV 
RNA (28-31,34-36). Importantly, the existence of a 
direct, long-range RNA-RNA interaction has recently 
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been proposed. This involves subdomain Illd of the IRES 
element and the essential stem-loop 5BSL3.2 of the CRE 
(cw-acting replication element) region at the 3'-end of the 
viral protein coding sequence (37) (Figure lA). This cir- 
cularization mechanism resembles that evolved by other 
closely related RNA viruses of the family Flaviviridae (38). 
End-to-end communication mediated by direct RNA- 
RNA interactions would hkely alter the conformation of 
the HCV IRES, allowing a regulatory mechanism for viral 
protein synthesis dependent on RNA high-order structure. 
The effect of the 3' genomic elements on IRES folding is 
required to gain deeper insights into the mechanism of 
HCV translation and its regulation. 

The present work provides evidence that IRES con- 
formation is fine-tuned by the 3'-end of the HCV 
genome. 3'-End-mediated IRES structural changes are 
mainly detected in the essential subdomains Illb and 
Illd, and in domain IV, suggesting this interaction to be 
important in the recruitment of the 40S particle and eIF3. 
These results are in good agreement with our previous 
data demonstrating long-range contacts between the 
apical loop of subdomain Illd and the internal loop of 
5BSL3.2 of CRE region, and highhghting the relevance 
of both the CRE and the 3'-UTR for the proper IRES 
activity (33). To our knowledge, this is the first report 
describing the requirement of two 3' genomic elements, 
CRE and 3'-UTR, for the fine-tuning of the structural 
and functional organization of the HCV IRES element. 
Further, this interaction occurs in the absence of protein 
factors. The obtained results point to local conformational 
rearrangements as key events in the regulation of viral 
translation and replication, as well as in switching 
between different steps of the HCV fife cycle. 

MATERIALS AND METHODS 

DNA templates and RNA synthesis 

DNA templates for the transcription of RNA molecules I, 
lU, IC and ICU were obtained by amplification as previ- 
ously described (33) to get the precise 3'-termini. Briefly, 
molecules T7pIU and TTpICU encoding the variants lU 
and ICU were amplified from their respective plasmid 
constructs pGL-IU and pGL-ICU, using the primers 
S'TVpHCV and 3'HCV. DNA templates containing 
either the I or the IC constructs (T7pl and TVpIC) were 
PCR amplified from the plasmid pGL-ICU: primers 
S'TVpHCV and asFLuc (5'-TCTAGAATTACACGGCG 
ATCTTTCCGC-3') were used for the amplification of the 
variant T7pl, whereas TVpIC was synthesized by using the 
oligonucleotides S'TVpHCV and asHCV-9321 (33). 

The plasmid encoding the S'HCV-691 transcript has 
been previously described (39). DNA templates coding 
for constructs I + U, I + C and I -I- CU were generated by 
site-directed mutagenesis using the Phusion site-directed 
mutagenesis kit (Finnzymes) from the template vector 
pGL-ICU (33), employing the following primers: 
HCV-9 1 8 1 (S'-GGGCAGTAAGGACCAAGCTCAAA 
-3') and asHCV-698 + Hindlll for pGLI + C and 
pGLI + CU plasmids, and HCV-9383 and asHCV- 
698 + Hindlll for pGLI + U construct. These vectors 



were used as DNA templates for PCR amplification of 
the molecules TVpI+U, TVpI + C and TVpI + CU from 
which the respective RNA variants (I + C, I + U and 
I + CU) were in vitro synthesized (see below). Primers 
S'TVpHCV and 3'HCV were used for the amplification 
of T7pl + U and T7pI + CU, whereas S'TVpHCV and 
asHCV-9321 allowed the production of TVpI + C. The in- 
tegrity of each construction was checked by DNA 
sequencing. 

The DNA vector containing the replication-competent 
construct, pFK-l389FLucNS3-3'ET (40), was kindly 
provided by Dr R. Bartenschlager. The DNA template 
for the generation of the replicative RNA transcript was 
obtained as previously described (40). 

All RNAs were synthesized using the RiboMAX'^'^-TV 
large scale RNA production system (Promega), following 
the manufacturer's instructions. The resulting transcripts 
were purified by phenol extraction and non-incorporated 
nucleotides were removed by two consecutive steps of size 
exclusion chromatography (Sephadex G-2S, GE 
Healthcare). The amount of RNA was determined by 
UV spectrophotometry (A260) and the extent of protein 
and carbohydrate/phenolic contaminations was assessed 
by A260/A280 and A260/A230 ratios, respectively. The integ- 
rity of the RNA molecules was determined by denaturing 
agarose-formaldehyde gel electrophoresis. 

Microarray design, RNA hybridization and data analysis 

Antisense DNA oligonucleotides complementary to IRES 
domains III and IV of HCV (genotype lb) were designed 
to specifically hybridize with sequences of 14 consecutive 
nucleotides of the viral genome, overlapping by seven 
nucleobases with the adjacent ohgos. Each 
HPLC-purified oHgonucleotide (IBA GmbH) was 
composed of a C6-amino hnker motif at its S'-end, 
followed by either (TCC)5 or CCCTCCCTCCCTCCC 
tracks (designed to avoid self-annealing within the oligo) 
as spacer regions, and the specific antisense sequence at 
the 3'-side of the ohgo (Supplementary Table SI). Their 
names correspond to the nucleotide on the IRES sequence 
complementary to the 3'-end of the oligo. Two different 
oligonucleotides with unrelated sequences (belonging to 
either the S'-genomic region of foot-and-mouth disease 
virus or the pol gene of human immunodeficiency virus 
type 1) were included as negative controls. The 3S oligo- 
nucleotides were diluted in Ix spotting solution 
(Telechem-Arrayit) and spotted onto epoxi-modified 
glass shdes (Telechem-Arrayit). Two concentrations (S 
and 20|iM) of each oligonucleotide probe were printed 
in triplicate ISOiim 0 spots, with a distance of 300 |im 
between groups of triplicate spots. Microarrays containing 
two duplicated grids of 210 spots were printed using a 
GSM 41V DNA arrayer (Affymetrix). 

The HCV RNAs were fluorescently labeled with Alexa 
64V (Invitrogen) as previously described (41). Briefly, 1 |.ig 
of RNA was incubated in 30^1 of labehng buffer for 
ISmin at 4°C, prior to the addition of 2|il of the fluores- 
cent dye, and the final mixture was incubated for lOmin at 
90°C. The labehng reaction was stopped by transferring 
the sample to ice for lOmin and purified by Microcon 
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(Millipore). The final volume was adjusted to 5-10 |il in a 
speed vacuum. 

The printed microarrays were pre-treated with 2x SSC 
and 0.1% A^-lauril sarcosine for 2min at room tempera- 
ture (RT). The unbound DNA oligos and the excess of 
spotting solution were washed by an additional incubation 
in 2x SSC for 2min. Then, printed ohgos were denatured 
by baking the microarrays for 2min in boiling milh-Q 
water, coohng for 10 s at RT and immediately fixed in 
ice-cold 100% ethanol as described (42). The microarrays 
were prehybridized in 6x SSC, 0.5% SDS and lOng/nl 
BSA for 45min at 42°C. A total of 300 ng of the fluores- 
cent transcripts were denatured by heating at 95°C for 
3min and subsequent coohng at 4°C prior to incubation 
with the slides. Hybridization was performed under 
non-denaturing conditions (100 mM HEPES pH 8.0, 
lOOmM NaCl, ImM MgCy for 2h at 37°C. Slides 
were washed twice in 1 x washing buffer (20 mM HEPES 
pH 8.0, 50 mM NaCl, ImM MgCh) in the presence of 
0.2% SDS for 15min at RT and then once again in 
washing buffer alone for 15min at RT. After a final 
rinse with water, the shdes were dried by centrifugation 
for 1 niin at 500 g and immediately scanned using a 
Genepix 4100 scanner. Data were retrieved using 
Genepix Pro 6.0 software. 

The differential capacity of each RNA transcript for 
hybridizing to the microarrays was measured in three in- 
dependent experiments. In aU cases, the raw fluorescence 
signal was corrected by subtraction of the local back- 
ground. To compare the signal among the three independ- 
ent hybridization assays, the mean fluorescence signal 
yielded by the 20|iM spots was normalized against the 
mean fluorescence of oligonucleotides 113 and 176 
(Supplementary Table SI). The signal range that defines 
the accessibility level was defined as foUows: the interquar- 
tile range of the hybridization values was calculated to 
identify the outliers, i.e. the positions showing the 
highest relative accessibility. The lowest of these outliers 
was estabhshed as the upper limit of the range. Quartile 3 
defined the lower limit of the range. 

Dimethyl sulfate probing 

A total of 1 pmol of purified HCV RNA (~ 1 ^g) was 
denatured for 3min at 95°C in folding buffer (50 mM 
HEPES pH 8.0, lOOmM NaCl,lmM MgCy before 
transferring to ice for 15min. The RNA was then 
incubated for 5min at 37°C. Reactions were initiated by 
the addition of 1 |il of freshly diluted dimethyl sulfate 
(DMS) in ethanol (1:5). Chemical probing proceeded for 
1 min at 37°C in the presence of tRNA (1 |.ig) and was 
stopped by the addition of 300 mM sodium acetate pH 
5.2. Untreated DMS reactions were performed in 
parallel. The RNA was ethanol-precipitated, washed 
three times with 80% ethanol and subjected to primer ex- 
tension for the detection of the modified A and C residues. 
Fluorescently labeled DNA ohgonucleotides (Applied 
Biosystems) used for the determination of DMS and 
A^-methyl-isatoic anhydride (NMIA) reactivity were 
purified using high-resolution denaturing polyacrylamide 
gels. Two 5'-end fluorescently labeled primers were used to 



map domains III and IV of the IRES element: asHCV-372 
(5'-TTTTTCTTTGAGGTTTAGGATTCGTGCT-3') 
and asHCV-539 (5'-GGGGATAGGTTGTCGCCTT-3'). 
Briefly, 2 pmol of gel-purified primer were hybridized 
with the total processed RNA by incubation for 2 min at 
95°C, foUowed by slow coohng to 52°C. Extension reac- 
tions were performed for 30 min at 52°C in a 20-(il 
reaction volume with reverse transcriptase buffer, 
0.5 mM dNTPs and 100 U Superscript III RT 
(Invitrogen). NED fluorophore was used for mapping 
the untreated probes; VIC was employed to read the 
DMS assays. RNA sequencing reactions were performed 
under identical conditions with the FAM and PET 
fluorescently labeled primers. A total of 2 |il of the result- 
ing cDNA samples were purified using the BigDye® 
XTerminator™ Purification kit (Applied Biosystems) 
and resolved in an Applied Biosystems 3130x1 Genetic 
Analyzer capfllary electrophoresis DNA sequencer, as pre- 
viously described (43). Electropherograms were analyzed 
using ShapeFinder software (44). Normalized DMS re- 
activity values for each nucleotide position were 
obtained by dividing each value by the average intensity 
of the 10% most reactive residues, after excluding outhers 
calculated by box plot analysis (43). This novel protocol 
afforded a DMS probing method using the technology 
developed for SHAPE chemistry. 

SHAPE analysis 

Treatment with NMIA was performed essentially as pre- 
viously reported (43). Briefly, 1 pmol of RNA was 
denatured as described earlier under the same ionic con- 
ditions. 2'-Acylation was initiated by the addition of 6mM 
of NMIA in dimethyl-sulfoxide. Reactions proceeded for 
3 min at 37°C. The formation of 2'-C>-adducts was stopped 
by ethanol precipitation. The RNA was subsequently 
washed three times with 80% ethanol and subjected to 
primer extension as described earlier. Normalized NMIA 
reactivity values for each nucleotide position were 
calculated as indicated for DMS probing. 

Statistical analysis 

All accessibihty and reactivity data are presented as 
mean ± SD. Data points were compared using the 
unpaired two-tailed Mann-Whitney test (45), as indicated 
in the figure legends. Significance was set at _P<0.05. 
/"-values are included in the corresponding 
Supplementary Tables and provided throughout the text 
when appropriate. 

RESULTS 

Microarray analysis detected RNA accessibility variations 
in the IRES element mediated by the 3'-end of the HCV 
genome 

A direct RNA-RNA interaction involving subdomain 
Illd of the HCV IRES element and the stem-loop 
5BSL3.2 located at the CRE region was initially 
demonstrated in trans (37) and its regulatory role in the 
HCV IRES function subsequently proven (33). Based on 
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these observations, it was postulated that the IRES con- 
formation might rely on long-distance RNA-RNA inter- 
action in cis with the 3'-end of the viral genome. To test 
this hypothesis, IRES accessibility in the presence of dif- 
ferent elements belonging to the 3'-end of the viral genome 
was tested by hybridizing (under non-denaturing condi- 
tions) each labeled transcript to a customized panel of 
overlapping 14-mer antisense DNA ohgonucleotides (see 
'Materials and Methods' section and Supplementary 
Table SI) complementary to domains III and IV of the 
IRES element. The use of DNA microarrays to analyze 
RNA structure is based on the strong correlation between 
the native structure of a target RNA and its ability to 
differentially hybridize with a panel of antisense ohgo- 
nucleotides printed onto the microarray surface. This 
technology has been successfully used for detecting struc- 
tural rearrangements affecting nucleotide stretches of 
target subgenomic RNA molecules from HCV (42), 
foot-and-mouth disease virus (41,46) and human immuno- 
deficiency virus (47). 

Microarray analyses were performed with previously 
described chimeric RNA molecules (33) encompassing 
the IRES element fused to the reporter gene flue (termed 
molecule I) plus different 3'-end regions (Figure IB). Thus, 
the variant IC carried the CRE element, the construct lU 
contained the 3'-UTR and the whole 3'-end of the HCV 
genome was included in the ICU molecule. The use of 
PCR-generated DNA templates assured the presence of 
the precise 3'-ends. The RNA constructs were subse- 
quently transcribed and fluorescently labeled, renatured 
and hybridized on the microarray under native conditions 
(19) as previously described (41,42). The fluorescence 
signal for each oligonucleotide was normalized with 
respect to the average fluorescence detected for oligos 
113 and 176 (see 'Materials and Methods' section). The 
mean and SD (from at least three independent assays) of 
the fluorescence signal was estimated for each oligonucleo- 
tide (Figure 2A). 

An initial overview of the hybridization profile for the 
molecule I reported an expected low-accessibility pattern, 
many regions showing signals close to the background 
levels (Figure 2A). Nevertheless, well-defined nucleotide 
stretches were detected to display either medium or high 
accessibihty. This suggested that the IRES folding is tight 
enough to preserve a unique, stable conformation. The 
addition of the whole 3'-end of the HCV genome 
(molecule ICU) revealed specific variations in the hybrid- 
ization pattern (Figure 2A). These differences were 
analyzed by a statistical non-parametric test to further 
evaluate their significance. It should be noted that the 
low accessibility profile shown by the molecule I 
prompted us to consider even minor changes in the 
signal, as long as the statistical significance of the obser- 
vation can be assessed. Moreover, since IRES folding is 
essentially autonomous (19), we paid special attention to 
those minor variations, which could reflect the conform- 
ational fine-tuning of functionally relevant domains. Thus, 
significant increases in the fluorescent signal of molecule 
ICU were detected at oligonucleotides 183 (P = 0.050) 
and 211 (P = 0.050), which mapped to subdomain Illb 
(Figure 2B). Despite these relative increases, the apical 



region of subdomain Illb remained fairly inaccessible 
(oligos 183-197) or only slightly accessible (204 and 211) 
in transcript ICU (Figure 2 and Supplementary Figure 
SI). Two other significant increases in molecule ICU 
with respect to I were found at ohgonucleotides 295 
{P = 0.050) and 316 (P = 0.050), which recognize the es- 
sential pseudoknots PKl and PK2 (subdomains Illef). 
This reflects the relative instability of these tertiary inter- 
actions in different structural environments when 
competing with longer antisense oligonucleotides. No add- 
itional changes in the hybridization profile induced by the 
CRE element or the 3'-UTR on their own (transcripts IC 
and lU, respectively) were preserved in the molecule ICU 
(Figure 2 and Supplementary Figure SI). Thus, the CRE 
region induced a significant reduction in the hybridization 
signal at ohgos 190, 204, 288, 330 and 337, which was 
annulled in the presence of the 3'-UTR (Figure 2 and 
Supplementary Figure SI). 

These data suggest that, independently of protein 
factors, the 3D folding adopted by the whole 3'-end of 
the HCV genome may be required for the acquisition of 
the functional conformation of the IRES subdomains Illb 
and Illd, and domain IV, at its 5'-end. 

DMS treatment showed the HCV IRES structure to be 
dependent on the 3' -end of the viral genome 

The antisense oligonucleotide microarray informs about 
the accessibihty of tracks of 14 nt of the HCV IRES 
element, thus providing a wide overview of the global con- 
formation of the molecule. However, it cannot reveal in- 
dividual nucleotides eventually involved in long-range 
RNA-RNA interactions. To solve such a limitation, 
DMS chemical probing was performed to map the 
precise IRES nucleotides affected by the presence of the 
CRE and 3'-UTR elements. 

The RNA transcripts previously used in the microarray 
assays (I, lU, IC and ICU) were denatured and renatured 
under low-magnesium conditions to achieve a stabilized 
native structure (19) as described earlier and then sub- 
jected to chemical modification with DMS (see 
'Materials and Methods' section). Reactive residues were 
detected by a high-throughput approach (43,48,49) in 
which the primer extension reactions were performed 
with two different sets of fluorescent primers (see 
'Materials and Methods' section). The reverse transcrip- 
tion elongation products were subsequently resolved by 
capillary electrophoresis and analyzed using 
ShapeFinder software (43). To our knowledge, this is the 
first report describing the use of that technology, origin- 
ally developed for SHAPE chemistry (43), for DMS 
chemical probing. Unpaired A and C residues present in 
domains III and IV of the IRES element were studied 
using this strategy. Occasionally, reverse transcriptase 
stop points were also detected 5' of highly reactive G 
residues (50,51). The relative reactivity at each nucleotide 
position was normalized as previously described (43), 
dividing the absolute reactivity value by the average of 
the 10% most reactive residues after excluding outliers. 
Mean and SD values (derived from at least four independ- 
ent experiments) were calculated and represented in a 
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(P<0.05) in fluorescence intensity for constructs IC and ICU with respect to transcript I are represented quahtatively as gain (+) or loss (— ) of 
accessibility. Functional IRES subdomains are shown. There were no significant differences in fluorescence intensity for construction lU with respect 
to transcript I. 



histogram to provide the DMS probing profile of domains 
III and IV (Supplementary Figure S2). 

The results for construct I corresponded perfectly with 
the secondary structure proposed for domains III and IV 
of the IRES element (15,17-19,27,52) (Figure 3A and 
Supplementary Figure S2A). Similar analyses of the lU, 



IC and ICU transcripts allowed for evaluation of potential 
conformational changes, with respect to molecule I, 
mediated by the CRE and/or the 3'-UTR elements. The 
resulting reactivity patterns suggested the existence of dif- 
ferences in the DMS map (Figure 3A and Supplementary 
Figure S2A). Subsequent statistical analysis identified 
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Figure 3. Differential DMS sensitivity of the HCV IRES element induced by the CRE and the 3'-UTR motifs of the viral genome. (A) The graph 
shows a comparative analysis with the values of DMS reactivity at each nucleotide position for the four constructs under study (I, lU, IC and ICU). 
Noisy residues in the apical loop of subdomain Illb are shadowed. Residues with DMS reactivity values <0.3 were classified as non-reactive 
positions. Lower panel: histogram representation showing a detailed view of the DMS reactivity pattern for subdomains Illb (nucleotides 
180-219) and Illd (nucleotides 253-280). Values are the mean of four independent experiments ± SD. (B) Significant variations (/'<0.05) in 
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(independently of their degree of variation) the specific 
and significant variations in DMS reactivity (Figure 3B, 
Supplementary Figure S2B and Supplementary Table S2 
for P values associated with each nucleotide position). 
Interestingly, the combined presence of the CRE and the 
3'-UTR elements (molecule ICU) specifically induced sig- 
nificant reductions in DMS reactivity at nucleotides A275 
(P = 0.020) and A276 (P = 0.020), located in the E loop 
motif of subdomain Illd (Figures 3 and 5; Supplementary 
Figure S2). This is in good agreement with previous results 
describing the existence of an interaction involving 
subdomain Illd of the IRES and the 3'-end of the HCV 
genome (37). 

In concordance with the microarray results, nucleotide 
A215 showed higher DMS reactivity in molecule ICU 
{P = 0.050), confirming the influence of the whole 3'-end 
in the folding of certain regions within the IRES element 
(Figure 5). The influence of either CRE or 3'-UTR 
elements on other positions of subdomain III (Figure 3 
and Supplementary Figure S2B), was assessed by 
comparing the accessibihty profiles for IC or lU molecules 
versus I. A significant increase in DMS sensitivity mediated 
by the CRE element was found at nucleotide G190 
(P = 0.025), which was preserved in the ICU construct. 
Other changes in the DMS reactivity profile observed 
for the variants IC and lU were annulled when the whole 
3'-end was used in the ICU construct (Figure 3 and 
Supplementary Figure S2). Importantly, most of the 
specific and significant reactivity changes detected were 
also found in the absence of the flue gene (Supplementary 
Figure S3A and S3B; Supplementary Table S3), ruling out 
the possibility of an indirect effect of the luciferase coding 
sequence on IRES folding. 

Together, these data show that the reactivity of certain 
specific residues in the HCV IRES element, including 
A275 and A276 of subdomain Illd, is affected by the 
presence of different 3'-tails, suggesting a direct influence 
of the CRE and 3'-UTR elements on IRES structure. 

SHAPE analysis detected variations in HCV IRES 
conformation mediated by the 3' -end of the viral genome 

Antisense ohgonucleotide microarray analysis and DMS 
probing can provide accurate information on the second- 
ary and, eventually, higher-order structure of a target 
RNA molecule. However, in many cases, the folding 
features of a target molecule do not necessarily depend 
on base-pair interactions, but on the conformation of the 
ribose-phosphate backbone. This may ultimately define the 
flexibihty and overall geometry of an RNA molecule. 
Selective 2'-hydroxyl acylation and primer extension 
(SHAPE) (53,54) analysis was therefore performed on 
transcripts I, lU, IC and ICU using NMIA as the acylating 
reagent. The formation of 2'-(9-adducts was monitored by 
primer extension with two sets of fluorescently labeled 
DNA ohgonucleotides and analyzed using ShapeFinder 
software as described earher (see also 'Materials and 
Methods' section). The mean relative reactivity and SD 
(for at least five independent experiments) was calculated 
for each nucleotide position. This allowed the construction 



of the corresponding SHAPE profile of the studied region 
(Figure 4A and Supplementary Figure S4A). 

The reactivity pattern obtained for transcript I showed 
no significant differences with respect to that previously 
reported (55) (Figure 4A). This reinforced the idea that the 
HCV IRES element adopts a preferential, stable conform- 
ation in native conditions. Interestingly, the comparative 
representation revealed remarkable differences in NMIA 
sensitivity for the transcripts encompassing different 
regions of the 3'-end with respect to molecule I (Figure 
4A and Supplementary Figure S4A). Statistical analyses 
filtered the large number of nucleotides differentially 
modified to reveal specific and significant differences in 
the reactivity profile for the molecule ICU compared to 
variant I in precise nucleotide positions (Figure 4B and 
Supplementary Figure S4B; see also Supplementary 
Table S4 for a detailed summary of the P values at each 
reactive residue). As detected in DMS probing and micro- 
array assays (Figures 2 and 3), modifications able to dras- 
tically influence the secondary structure were not 
expected. Hence, the study was focused on statistically 
significant reactivity variations affecting to nucleotides 
extended stretch at functionally relevant domains. Most 
of these differences affected structural elements essential 
for the IRES function (Figures 4 and 5). Interestingly, a 
significant reduction in NMIA sensitivity was observed for 
residues U265 (P = 0.034), G266 (P = 0.050) and G267 
(P = 0.048), located in the apical loop of subdomain 
Illd. Again, this firmly supports the previous hypothesis 
that a long-range RNA-RNA interaction can be estab- 
hshed between subdomains Illd and the CRE element 
(37). SHAPE analysis also revealed significant increases 
(P< 0.050) in the flexibility of the residues flanking 
the translation initiation codon in domain IV (Figures 4 
and 5; Supplementary Table S4). In addition, several nu- 
cleotides throughout the stem of subdomain Illb showed 
differential NMIA sensitivity compared to transcript I, 
suggesting alterations in its local conformation. 
Deviations (mainly reductions) in the reactivity pattern 
related to single nucleotides were also significant in sub- 
domains Ilia, IIIc and Ille, suggesting the existence of 
sporadic folding reorganization processes upon the inter- 
action of these subdomains with the 3'-end of the viral 
genome (Figures 4 and 5; Supplementary Table S4). 

To analyze the individual role played by the CRE and 3'- 
UTR elements present in the ICU molecule, the IC and lU 
constructs were compared to molecule I using SHAPE 
chemistry. The presence of each of the 3' elements also af- 
fected the reactivity pattern at single nucleotide positions, 
most of these modifications being preserved in the ICU 
molecule (Figures 4 and 5; Supplementary Figure S4). 
These results support the notion of a specific role for 
these 3' elements in the direct regulation of IRES conform- 
ation. Thus, the presence of the CRE region promoted re- 
markable and significant variations in NMIA-reactivity at 
nucleotides located throughout the stem of subdomain Illb 
and domain IV (Figures 4 and 5; Supplementary Table S4) 
in variants IC and ICU. Slight susceptibility to the presence 
of the independent 3'-UTR element (molecule lU) was also 
noted at single nucleotide positions throughout domains 
III and IV (Figures 4 and 5; Supplementary Table S4). 
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All these reactivity modifications involving 1 nt are likely 
related to local conformational changes produced in the 
IRES element triggered by the presence of the different 3' 
elements. Further modifications in the IRES SHAPE 
pattern promoted by the independent CRE and 3'-UTR 
elements were reversed when both domains were 
combined in the ICU transcript, as previously noted in 
the DMS analysis (Figure 4). Moreover, the reductions in 



reactivity detected for the variant ICU at subdomain Illd 
represent specific events mediated by the presence of the 
3'-tail domains, as evidenced by the fact that most of 
the reactivity changes were preserved in the constructs 
lacking the flue gene (Supplementary Figure S3C and 
Supplementary Table S4). 

These results support the idea that conformational 
rearrangements affecting essential regions of the IRES 



Nucleic Acids Research, 2012, Vol. 40, No. 22 11707 



element (such as subdomains Illb and Illd, as well as 
domain IV) are strongly mediated by structural domains 
placed at the 3'-end of the HCV genome. Therefore, such 
long-range RNA-RNA interactions may contribute to the 
acquisition of a fully functional IRES structure. 

IRES folding in a replication-competent viral RNA 

The structural analyses described earlier were performed 
on IRES-dependent translationally active RNA con- 
structs. DMS chemical probing and SHAPE structural 
analysis were then performed using a rephcation- 
competent viral RNA molecule. For this, a subgenomic 
HCV replicon RNA was obtained by in vitro transcription 
from the plasmid pFK-laggFLucNSS-S'ET (40). This con- 
struct (termed Rep) carries the HCV IRES derived from 
genotype lb followed by the flue gene, the IRES element 
belonging to the encephalomyocarditis virus (EMCV) and 
the coding sequence for the non-structural HCV proteins 
plus the HCV 3'-UTR (Figure 6A). 

Chemical probing with DMS was performed on Rep 
under the same experimental conditions as those described 
earlier. The mean of the relative reactivity values from 
four independent assays was estimated at each nucleotide 
position to evaluate the accessibihty pattern (Figure 6B). 
Compared to construct I, notable differences were 
measured in the DMS reactivity profile affecting residues 
located in the E loop of subdomain Illd. Reduction in 
accessibility reached statistical significance at the tri- 
nucleotide A274-A276 of subdomain Illd (A274, 
P = 0.011; A275, 7^ = 0.011; A276, P = 0.022; Figures 
6B and 6C, as well as Supplementary Figure S5A and 
S5B; see Supplementary Table S2 for a complete hst of 
the P values corresponding to those reactive positions). 
This is in good agreement with DMS probing data previ- 
ously obtained for the ICU transcript. The DMS reactivity 
profile of the central stem of domain III was also affected 
in Rep (A243, P = 0.008; A244, P = 0.020; Figures 6B 
and 6C; Supplementary Table S2) compared to construct 
I. Accordingly, SHAPE analysis also revealed differential 
reactivity for these nucleotides in the ICU transcript 
(Figure 5). Compared to construct I; any other 1-nt modi- 
fications in the DMS sensitivity of Rep was hardly 
observed (Figure 6B and 6C; Supplementary Figure S5A 
and S5B). 

The study of the IRES conformation in the 
replication-competent RNA was complemented with 
SHAPE analyses. The comparison of the IRES reactivity 
profile in the construct Rep with that of variant I showed 
several changes analogous to those previously observed 
for the ICU transcript (Figure 7A and Supplementary 
Figure S5C), thus reinforcing the consistency of our 
findings. These changes consisted in significant reductions 
in NMIA reactivity in the apical loop of subdomain Illd 
(G263, P = 0.034; U265, P = 0.025; G267, P = 0.034; 
G268, P = 0.050 and U269, P = 0.034; Figures 7B 
and 8; Supplementary Figure S5C and S5D). This is a 
further proof supporting a long-range interaction 
between IRES subdomain Illd and the 3'-end of the 
HCV genome (37). Local rearrangements at the stem of 
subdomain Illb were also inferred from the SHAPE 



reactivity pattern, in agreement with those detected for 
the ICU molecule (Figures 7 and 8; Supplementary 
Figure S5C and S5D; see also Supplementary Table S4 
for P values at each reactive nucleotide). Additional vari- 
ations in the SHAPE pattern were also detected at differ- 
ent nucleotides, suggesting that other RNA domains of 
the HCV genome play a role in the overall conformation 
of the IRES element. Among them, a relevant change in 
reactivity to chemical modification was detected at the 
short basal stem of domain III: compared to molecule I, 
the residues C139-U141 of Rep were more exposed to 
both NMIA and DMS probes (Figures 7 and 8). 

Together, the present data support the existence of a 
complex, long-distance RNA-RNA interaction network 
in the HCV genome, which fine-tunes the 3D structure 
of the active IRES element. 



DISCUSSION 

HCV genome contains well-defined, phylogenetically 
conserved structural elements that mainly localize at its 
5'- and 3'-termini and extend to the ORE (56-60). These 
regions are essential for the consecution of two key, 
closely related processes for the viral cycle: translation 
and replication. In particular, HCV translation initiation 
is dependent on an IRES element located at the 5'-end of 
the viral genome and influenced by the presence of RNA 
domains placed at the 3'-end of the genome. Both ends can 
participate in the acquisition of a circular conformation of 
the viral genome, which is typically associated to the inter- 
action with cellular and viral protein factors (61). We 
recently reported that long-distant RNA-RNA inter- 
actions between both ends of the HCV genome may 
help form this circular topology, with important conse- 
quences for IRES function (37). The results of the 
present work show that the IRES conformation is 
fine-tuned by the presence of different RNA domains at 
the 3'-end, independently of protein factors. Our findings 
also suggest that the overaU effect of the 3'-end on IRES 
folding reflects the combined contribution of CRE and 3'- 
UTR elements, since the presence of each of these 3' 
regions induces conformational changes that map to dif- 
ferent IRES nucleotides. These findings are reinforced by 
the fact that the structural rearrangements of the IRES 
induced by the 3'-end elements are functionally relevant 
in the context of translation and replication-active RNA 
transcripts. Therefore, though the IRES region acquires a 
functional conformation in the absence of additional 
RNA elements, as previously described (16,19), our 
results show that its structure can be further fine-tuned 
by the 3'-end of the HCV genome. This is in good agree- 
ment with previous observations demonstrating the func- 
tional role of the CRE and the 3'-UTR elements on IRES 
activity (33). To our knowledge, this is the first report 
describing complex, long-range RNA-RNA interactions 
between the 5'- and 3'-ends of HCV genome that influence 
the functional conformation of viral RNA elements. 

Three experimental strategies were followed to study the 
structure of the IRES element in the presence of different 
3'-tails: hybridization to antisense DNA oligonucleotide 
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microarrays, DMS chemical probing and SHAPE struc- 
tural analysis. These methodologies provide alternative 
and complementary information on the 3D structure of 
a target RNA molecule. Microarray analysis informs 
about the accessibihty of consecutive, overlapping 
stretches of nucleotides, being the main limitation of the 
method that it cannot provide single-nucleotide reso- 
lution. In turn, classical chemical probing methods, such 
as DMS, inform about the solvent accessibihty of the 
nucleobases at the residue level, although the data 
obtained are still limited since DMS does not recognize 
alternative conformational rearrangements that might 
affect the ribose-phosphate backbone without altering 
the base-pairing and stacking pattern. Finally, SHAPE 
structural analysis has minor but significant drawbacks 
that must be taken into account. It has been described 



that 2'-OH reactivity does not depend on solvent accessi- 
bihty and may be slightly influenced by several factors 
including the pK^ value of the surrounding nucleotides 
(62), the nucleobase identity (63) and, more importantly, 
the acquisition of alternative foldings in the RNA duplex 
distinct from the canonical A-form (64). This implies that 
all the conformationally dynamic residues can be detected 
by NMIA probing, although they remain solvent inaccess- 
ible. In this work, capillary electrophoresis followed by 
analysis using ShapeFinder software was used to detect 
not only NMIA modifications but also DMS signals. 
This experimental achievement facihtated comparisons 
between the results obtained using both types of probing 
methods. 

Analysis of the IRES folding in the construct lacking 
any 3'-tail (molecule I) using these methodologies revealed 
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O Decrease in relative NIVIIA reactivity in molecule Rep vs I 
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Figure 8. Impact of the non-structural proteins coding sequence on IRES folding. Summary of significant reactivity differences {P < 0.05) observed 
in DMS and SHAPE profiles for the replication-competent RNA molecule (Rep) with respect to transcript I. Filled circles: significant increase 
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reduction {P<Q.05) in DMS reactivity in Rep with respect to construct I; grey squares: noisy residues. 



that the global conformation of domains III and IV 
(Figures 2, 3 and 4) was consistent with the previously 
published results (14,17-20,55,65). Interestingly, struc- 
tural features of functionally relevant regions of the 
IRES element were evidenced by this study. For 
example, the microarray analysis reported high fluores- 
cence signal at the ohgonucleotide 176, which recognizes 
the stem of subdomain Illb (Figure 2 and Supplementary 
Figure SI). This observation was reinforced by the 
medium NMIA reactivity detected for this nucleotide 
stretch (Figure 4). In contrast, this region was barely 
susceptible to DMS modification (Figure 3 and 
Supplementary Figure S2). It should be noted that the 



subdomain Illb of HCV IRES contains a GC-rich stem 
interrupted by a variable internal loop and a CC mismatch, 
followed by an apical hairpin loop (17). These features 
remarkably determine its geometry and suggest that 
certain flexibihty might account at the backbone of this 
region while the base-pairing is preserved in the duplex, as 
detected by both previous NMR studies (17) and the DMS 
probing reported in this study (Figure 3). Therefore, our 
results evidence the usefulness of combining different 
methods to obtain a complete overview of the RNA 
folding in structurally complex regions. 

Experimental mapping of HCV IRES structure was 
initiated with RNA transcripts bearing distinct 3'-tails 
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that had been previously shown to exhibit differential 
translational efficiency (33). Significant changes in the 
NMIA and DMS reactivity patterns were detected in 
specific structural regions of the IRES domains 111 and 
IV, suggesting that local conformation rearrangements 
are tuned by the presence of CRE and/or 3'-UTR 
elements (Figures 2-5). Thus, in the presence of the 
whole 3'-end of the HCV genome (molecule ICU), a re- 
markable reduction in NMIA reactivity was detected at 
specific residues of the apical loop of subdomain llld. 
Also, DMS reactivity was reduced at nucleotides belong- 
ing to the essential internal E loop of this subdomain 
(Figures 3 and 5). NMR and molecular modehng studies 
have previously shown that the formation of a typical 
U-turn motif (27) at the apical loop could be related to 
the proper recruitment of the ribosomal subunit 40S 
(11,16,19,25,26). Significantly, the subdomain Illd is also 
a primary determinant in the acquisition of the functional 
folding of surrounding structural elements (19). Hence, it 
seems hkely that a decrease in the flexibihty of this 
subdomain upon interaction with the whole 3'-end of the 
viral genome (Figures 4 and 5) would affect the local 
structural environment. This is in good agreement with 
the concomitant increase detected in the microarray hy- 
bridization signal for the molecule ICU at oligonucleo- 
tides 295 and 316, mapping at the subdomain Ille and 
the PKl element, respectively (Figures 2 and 5). This 
region has been recently related to the formation of a 
double pseudoknot structure that controls the orientation 
of the start codon at the ribosomal P site (14). 
Importantly, these findings were confirmed using a short 
RNA transcript lacking the flue gene, what supports the 
specificity of the observed effect (Supplementary Figure S3 
and Supplementary Table S5). These IRES rearrange- 
ments triggered by the interaction with the CRE and 3'- 
UTR elements may intervene in translation initiation by 
affecting to the 48 S complex assembly, thus providing an 
explanation for the translational properties of the tran- 
script ICU (33). The results obtained with this transcript 
showed that the 3'-UTR enhancer effect on HCV IRES 
function can be further regulated by the CRE element. 

Important variations in the reactivity and hybridization 
profiles were also observed in the transcript ICU, 
compared to molecule 1, in the elF3 binding platform 
located at the major stem of domain 111 (Figures 2-5). It 
has been described that the binding of eIF3 is strictly de- 
pendent on the intact four-way helical junction defined by 
subdomains Illabc (18). In addition, the typical S-turn 
geometry of the internal loop of the subdomain 111b is 
also necessary for full IRES-dependent translational effi- 
ciency (17). The conformational variations throughout 
subdomain 111b mediated by the 3'-end could influence 
its proper folding required for the interaction with elF3 
(17). This provides additional clues to understand the 
singular translational properties of the ICU construct (33). 

Domain IV was also remarkably affected by the 
presence of the whole 3'-end of the viral genome, as 
derived from the NMIA reactivity pattern (Figures 4 
and 5). It has been described that the interplay between 
domains II and IV of the HCV IRES and the ribosomal 
subunit 40S seems to be essential for the proper unwinding 



of the stem and the further recognition of the AUG codon 
(65). Therefore, it is assumed that the stabihty of domain 
IV is inversely correlated with the IRES-dependent trans- 
lation efficiency (20). The significant gain observed in 
NMIA reactivity of the transcript ICU at numerous 
nucleotides of domain IV (Figures 4 and 5) points to an 
increase in the flexibihty of this domain. Therefore, in a 
biological context, the presence of the whole 3'-end of 
the viral genome could promote a destabilization of 
domain IV additional to that exerted by domain II, 
thus contributing to the proper positioning of the start 
codon. A further investigation of this interaction would 
provide important insights into the efficiency and regula- 
tion of the IRES-dependent translation, as previously 
suggested (33). 

The analysis performed on the synthetic construct ICU 
was extended to a replication-competent RNA transcript 
(Figures 6, 7 and 8). Significant reductions in DMS and 
NMIA reactivity were again measured at the apical and 
the internal loops of subdomain llld. Variations in the 
reactivity profile at the major stems of domain 111 were 
also detected. These results confirm the robustness of the 
present findings and show that the conformation of 
important functional regions of the IRES element is 
modulated by the 3'-end of the viral genome. Further, in 
the context of the transcript Rep, the IRES element shows 
changes in reactivity that affect a larger number of nucleo- 
tides than those previously mapped in the transcript ICU. 
In particular, changes in DMS and NMIA reactivity 
involving the short basal stem of domain 111 were specif- 
ically detected in Rep (Figures 6 and 7). This suggests the 
existence of complex and still unexplored interaction 
networks that involve additional distant regions located 
throughout the non-structural proteins coding region of 
the HCV genome. 

In summary, the present study shows that the 3D 
folding of domains 111 and IV of the IRES element is 
fine-tuned by RNA-RNA interactions involving the 
entire 3'-end of the HCV genome. Such long-range inter- 
actions occur in the absence of protein factors. This 
findings could have important consequences for a better 
understanding of IRES-dependent HCV translation, in 
agreement with previous observations demonstrating a 
regulatory role played by the entire viral 3'-end (33). 
Further studies will be required to clarify the mechanistic 
and functional details of the observed structural changes. 
Taken together, the reported observations, along with 
previous data, depict a more complex scenario for HCV 
IRES-dependent translation, in which both RNA-RNA 
and RNA-protein interactions cooperate to modulate 
viral protein synthesis. These findings might eventually 
be extended to other closely related, IRES-containing 
RNA viruses. Finally it is worth noting that our data ex- 
emphfy that the structure of even well characterized RNA 
elements, such as the HCV IRES element, may be refined 
when they are analyzed in a genomic context that includes 
additional RNA regions. Thus, complex interaction 
networks determined by long-distance, direct RNA- 
RNA contacts, could play important roles in the regula- 
tion of essential steps in the HCV cycle. 
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